Goto

Collaborating Authors

 sagemaker studio


Enabling MLOPs in Three Simple Steps

#artificialintelligence

I recently engaged in a project involving the implementation of a multiclass classification prediction system utilising financial transactional data, comprising over 10 million records and over 70 classes. Through this project, I constructed a streamlined end-to-end machine learning operations (MLOPs) infrastructure that is well-suited for this specific use case, while maintaining cost efficiency. The term MLOPs has a broad range of concepts and definitions, as offered by various vendors or solutions. Some focus on aspects such as training traceability and experimental tracking, while others prioritise feature storage or model deployment. In my understanding, MLOPs is the entire end-to-end process, from data extraction to model deployment and monitoring.


Set up Amazon SageMaker Studio with Jupyter Lab 3 using the AWS CDK

#artificialintelligence

Amazon SageMaker Studio is a fully integrated development environment (IDE) for machine learning (ML) partly based on JupyterLab 3. Studio provides a web-based interface to interactively perform ML development tasks required to prepare data and build, train, and deploy ML models. In Studio, you can load data, adjust ML models, move in between steps to adjust experiments, compare results, and deploy ML models for inference. The AWS Cloud Development Kit (AWS CDK) is an open-source software development framework to create AWS CloudFormation stacks through automatic CloudFormation template generation. A stack is a collection of AWS resources, that can be programmatically updated, moved, or deleted. AWS CDK constructs are the building blocks of AWS CDK applications, representing the blueprint to define cloud architectures.


Operationalize your Amazon SageMaker Studio notebooks as scheduled notebook jobs

#artificialintelligence

Amazon SageMaker Studio provides a fully managed solution for data scientists to interactively build, train, and deploy machine learning (ML) models. In addition to the interactive ML experience, data workers also seek solutions to run notebooks as ephemeral jobs without the need to refactor code as Python modules or learn DevOps tools and best practices to automate their deployment infrastructure. Previously, when data scientists wanted to take the code they built interactively on notebooks and run them as batch jobs, they were faced with a steep learning curve using Amazon SageMaker Pipelines, AWS Lambda, Amazon EventBridge, or other solutions that are difficult to set up, use, and manage. With SageMaker notebook jobs, you can now run your notebooks as is or in a parameterized fashion with just a few simple clicks from the SageMaker Studio or SageMaker Studio Lab interface. You can run these notebooks on a schedule or immediately.


An Introduction to Amazon SageMaker

#artificialintelligence

Amazon SageMaker helps data scientists and inventors to prepare, make, train, and deploy high- quality machine learning models by bringing together a broad set of capabilities purpose- erected for machine learning. Amazon SageMaker make available a set of solutions for the most common use cases that may be deployed readily with just a few clicks to make it easier to grow started. Amazon SageMaker is a completely accomplished machine learning service. Data scientists and developers may speedily and easily build and train machine learning models with SageMaker. They can straight deploy them into a production-ready hosted environment.


Automate ML Development With Amazon Sagemaker - Analytics Vidhya

#artificialintelligence

This article was published as a part of the Data Science Blogathon. Amazon Sagemaker is arguably the most powerful, feature-rich, and fully managed machine learning service developed by Amazon. From creating your own labeled datasets to deploying and monitoring the models on production, Sagemaker is equipped to do everything. It can also provide an integrated Jupyter notebook instance for easy access to your data for exploration and analysis, so you don't have to fiddle around with server configuration. Sagemaker supports bring-your-own-algorithms and frameworks, which offer flexible distributed training options that adjust to your specific workflows.


A Beginner's Guide to AutoML - Solita Data

#artificialintelligence

Automated Machine Learning (AutoML) is a concept that provides the means to utilise existing data and create models for non-Machine Learning experts. In addition to that, AutoML provides Machine Learning (ML) professionals ways to develop and use effective models without spending time on tasks such as data cleaning and preprocessing, feature engineering, model selection, hyperparameter tuning, etc. Before we move any further, it is important to note that AutoML is not some system that has been developed by a single entity. Several organisations have developed their own AutoML packages. These packages cover a broad area, and targets people at different skill levels.


3 + 1 ways of running R on Amazon SageMaker

#artificialintelligence

The R programming language is one of the most commonly used languages in the scientific space, being one of the most commonly used languages for machine learning (probably second following python) and arguably the most popular language amongst mathematicians and statisticians. It is easy to get started with, free to use, with support for many scientific and visualisation libraries. While R can help you analyse your data, the more data you have the more compute power you require and the more impactful your analysis is, the more repeatability and reproducibility is required. Analysts and Data Scientists need to find ways to fulfil such requirements. In this post we briefly describe the main ways of running your R workloads on the cloud, making use of Amazon SageMaker, the end-to-end Machine Learning cloud offering of AWS.


Exploring SageMaker Canvas

#artificialintelligence

Building Machine Learning models takes knowledge, experience, and a lot of time. Sometimes different persona such as Business Analysts or other technocrats who do not have experience with ML might have a ML use-case that they may want to address, but lack the expertise to do so. Even ML engineers and Data Scientists who have ML experience may want a model built quickly. This brings us to the domain of AutoML. Nowadays we're seeing a plethora of AutoML solutions from open source APIs to individual services/platforms geared for automating the ML space.


Make batch predictions with Amazon SageMaker Autopilot

#artificialintelligence

Amazon SageMaker Autopilot is an automated machine learning (AutoML) solution that performs all the tasks you need to complete an end-to-end machine learning (ML) workflow. It explores and prepares your data, applies different algorithms to generate a model, and transparently provides model insights and explainability reports to help you interpret the results. Autopilot can also create a real-time endpoint for online inference. You can access Autopilot's one-click features in Amazon SageMaker Studio or by using the AWS SDK for Python (Boto3) or the SageMaker Python SDK. In this post, we show how to make batch predictions on an unlabeled dataset using an Autopilot-trained model.


Top 12 AI and machine learning announcements at AWS re:Invent 2021

#artificialintelligence

This week during its re:Invent 2021 conference in Las Vegas, Amazon announced a slew of new AI and machine learning products and updates across its Amazon Web Services (AWS) portfolio. Touching on DevOps, big data, and analytics, among the highlights were a call summarization feature for Amazon Lex and a capability in CodeGuru that helps detect secrets in source code. Amazon's continued embrace of AI comes as enterprises express a willingness to pilot automation technologies in transitioning their businesses online. Fifty-two percent of companies accelerated their AI adoption plans because of the COVID pandemic, according to a PricewaterhouseCoopers study. Meanwhile, Harris Poll found that 55% of companies accelerated their AI strategy in 2020 and 67% expect to further accelerate their strategy in 2021.